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The observed angular correlation function of the cosmic microwave background has previously 
been reported to be anomalous, particularly when measured in regions of the sky uncontaminated 
by Galactic emission. Recent work by Efstathiou et al. presents a Bayesian comparison of isotropic 
theories, casting doubt on the significance of the purported anomaly. We extend this analysis to all 
anisotropic Gaussian theories with vanishing mean {{ST) — 0), using the much wider class of models 
to confirm that the anomaly is not likely to point to new physics. On the other hand if there is 
any new physics to be gleaned, it results from low-^ alignments which will be better quantified by 
a full-sky statistic. 

We also consider quadratic maximum likelihood power spectrum estimators that are constructed 
assuming isotropy. The underlying assumptions are therefore false if the ensemble is anisotropic. 
Nonetheless we demonstrate that, for theories compatible with the observed sky, these estimators 
(while no longer optimal) remain statistically superior to pseudo-C^ power spectrum estimators. 



I. INTRODUCTION 

Observations of the cosmic microwave background 
(CMB) by the Wilkinson Microwave Anisotropy Probe 
(WMAP; e.g. [Il[2]) are widely interpreted as confirm- 
ing the standard model of cosmology in which inflation 
generates a homogeneous and isotropic background and 
seeds isotropic, nearly scale-free perturbations. Yet a va- 
riety of tests suggest that, on large scales, something may 
be amiss [3Hl0] . (For a wide-ranging assessment of such 
anomalies in the 7-yr WMAP data see Ref. [TT]-) The 
interpretation of these results is complicated by the a 
posteriori nature of anomaly hunting; any large dataset 
will contain statistical flukes which, in isolation, can be 
made to look unacceptable. This is a particularly per- 
nicious problem in the context of large-scale cosmology: 
with only one sky to observe, frequentist statistics are 
almost impossible to interpret. 

Frequentist results can be made into more concrete 
Bayesian statements by considering specific alternative 
CMB theories or classes of theories (see e.g. Refs. [T^ 
[Ti]). But a single, fixed dataset can still contribute over- 
whelming evidence in favour of or against the very same 
theory, depending on the alternatives against which we 
are judging (for an elucidation of this point, see Ref. [13], 
Sec. 5.5). In other words there is no unique way to as- 
cribe significance to departures from the standard theory. 

This does not imply we should abandon critical eval- 
uations of WMAP and other data: if we simply accept 
we have an 'unlikely' realization of our favoured theory, 
we might miss the opportunity to discover new physics 
(or instrumental systematics) . Thus frequentist results 
cannot be dismissed out-of-hand; but we would advo- 



cate their interpretation as pointers to interesting areas 
of work, rather than quantifiable death-knells of existing 
models or theories. 

In the present work, we will consider a long-standing 
debate about the nature of the angular correlation func- 
tion C{9) of the CMB. The argument is usually phrased 
in terms of the statistic Sy^^ which traces the extent 
to which temperature fluctuations (outside a Galactic 
mask) are correlated between points separated by 60° 
or more. For a quantitative definition, see Section [IT] 
A number of recent works have attempted to assess the 
significance of the purportedly anomalous value of S^^^^ 
reaching essentially contradictory conclusions. In partic- 
ular, the frequentist P-value [10 suggests the observed 
sky is highly anomalous, while a Bayesian analysis of the 
optimally reconstructed sky by Efstathiou et al. sug- 
gests the opposite [IS]; see also Ref. [TT]. However, any 
Bayesian result pivots crucially on the alternative mod- 
els considered; the assumptions in Ref. [TS] mean that 
only isotropic models are considered. This is a significant 
omission, since it leaves open the possibility that subop- 
timal estimates of S1/2 formed from cut sky data can be 
reframed as useful measures of anisotropy. 

The present work rectifies that omission. The anomaly 
is analysed from within harmonic space, and then 
anisotropic theories which make our CMB realization 
more probable are considered. The anomaly is found 
to be uninformative in the following two senses: 

1. The trivial maximum likelihood anisotropic Gaus- 
sian theory for our observed sky^ does not lead to 
substantially better likelihoods for the single statis- 
tic S^'^l; 
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^ Namely, that with covariance matrix C 
observed sky data vector. 
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2. Theories constructed specifically to maximize the 
likelihood of (ignoring the rest of the informa- 
tion on the sky) also yield little gain. 



These failures arise from the large variance inherent in us- 
ing a statistic, such as j which is quartic in the data. 
Overall, then, the present work reinforces the view that 
the frequentist 'unlikeliness' of the observed sky must be 
regarded as a statistical fluke. 

Some broader results arise from our study. First, we 
consider the effect of an anisotropic theory on quadratic 
maximum likelihood (QML) estimates of the power spec- 
trum. The QML estimators are derived under the (in 
this context false) assumption of isotropy; despite this, 
they typically remain superior to pseudo-C^ approaches 
to power spectrum estimation (Section |llj with detail in 
Appendix A 5). Second, we present an extremely fast 



method for finding the maximum angular momentum di- 
rection of a CMB map (Appendix |e]). Third, we demon- 
strate that cut-sky correlation functions can be exactly 
reproduced from the pseudo-C^ power spectrum (Ap- 
pendix |b|. This final result, applicable also for weighted 
data, has been reported previously I17| but ignored by 
recent work; to our knowledge no proof appears in the 
existing literature. 

The paper is structured as follows. Section [ll] intro- 
duces the necessary background and notation. In Sec- 
tion |lll] we consider, from a harmonic-space perspective, 
the origin of the low observed Sy^- Anisotropic, Gaus- 
sian theories which reproduce this result are considered 
in Section [rv] and show that even the best conceivable fit 
to the observed CMB makes no substantial improvement 
to the Sy^ likelihood. Finally, the work is summarized in 
Section |Vl Further details and discussion are contained 
in appendices. 



II. BACKGROUND AND NOTATION 

In this Section we set out the various definitions needed 
in our work. Let us start by defining the observed tem- 
perature correlation function C{9) as 



C{0)=Tih,)T{h2) 



(1) 



where the overbar denotes averaging over all observed 
line-of-sight vector pairs ni, 62 satisfying ni • n2 = cos 9. 
We further define 
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where the aim's are the spherical harmonic coefficients of 
the temperature field on the observed sky. A calligraphic 
Ci thus denotes the observed power, distinguished from 
the theoretical variances Cg which we regard as defined 
by the relation 



Ci = id) 



(3) 



for isotropic and anisotropic theories alike. 

There is an exact relationship between C{6) and Ce, 
namely 



C{e) = ^'^{2£+l)CePi{cost 



(4) 



where Pi are the Legendre polynomials. Throughout this 
paper in our numerical calculations, we use a finite sum 
over 2 < i < 30; the lower limit discards any contribution 
from the monopole and dipole, while we verified that the 
upper limit is high enough for our results to converge. 

Result Q holds regardless of any theoretical con- 
straints (such as isotropy). Thus the information in the 
correlation function is identical to that in the C^'s ob- 
served on the sky. If desired, one can define the theoreti- 
cal correlation function to be the ensemble average of the 
sky-observed correlation function, C{9) = {C{9)). 

The purported anomalies relate to the apparent lack 
of correlations on large angular scales, quantified by 



/•1/2 

Si/2 = J C{9fsm9d9. 



(5) 



This quantity is a measure of the extent to which the tem- 
perature from points separated by 60° or more is corre- 
lated. Rather than evaluate ([s]) directly, it is much faster 
and numerically more stable to calculate S1/2 from the 
quadratic form 



Sl/2 — E ^^^^ 



(6) 



where, as above, £ and i' range from 2 to 30 in our nu- 
merical calculations and 



1/2 



Pi{x)Pi,{x)dx, 



(7) 



which may be computed using well-known recursion re- 
lations {e.g. Appendix C.2 of Ref. [H]; see also Ref. 

If one does not trust information inside a specified 
mask (for instance due to suspected Galactic contami- 
nation) , one may calculate the correlation function using 
only the points outside the mask. 



T(ni)T(n2) 



Af(ni)=A/(n2) = l 



(8) 



where M{n) is a masking function (equal to or 1 in each 
pixel) , so that the angular average denoted by the overbar 
is over all point pairs (with fii • n2 = cos 9) which lie out- 
side the mask. This procedure is mathematically identi- 
cal to calculating the Legendre sum over the pseudo-C^ 
(PCL) estimates for the power spectrum (which we de- 
note Cf''-"^; see Appendix [a| for a precise definition): 



+ l)Cj^^Pi{cost 



(9) 
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While this result has been reported before [T7], an ex- 
plicit proof does not appear to exist in the literature, so 
we provide one in Appendix \B\ 

It is clear that, if Ci are any unbiased estimates for 
Ci, then forming their Legendre sum Q yields an un- 
biased estimator for C{9). Thus if one wishes to find 
maximum likelihood estimates for C{9) on the full sky 
from cut sky information, by linearity one simply substi- 
tutes the maximum likelihood Cg estimates in place of the 
PCL estimates. For our purposes, the estimates provided 
by the quadratic maximum likelihood (QML) estimator 
technique [19j are close enough to the exact maximum 
likelihood to remove the need for any non-linear tech- 
niques unj- 

There are two somewhat subtle points to be appreci- 
ated here. First, reservations have been expressed about 
the use of the QML estimator, since it uses prior infor- 
mation on the power spectrum and therefore appears to 
make strong assumptions about the form of the underly- 
ing theory. Copi et al. [TUl express concern about use of 
the QML estimator in circumstances where one is ques- 
tioning the validity of the concordance model. In fact, 
this unease turns out to be unwarranted; one may explic- 
itly show that QML estimates remain superior to PCL 
estimates ~ even in cases where the estimates for the co- 
variance matrix are not correct. 

Let us outline why this should be so. A full derivation 
is given in Appendix |A 5[ in which the QML estimator 
is written entirely within harmonic space. The estima- 
tion procedure can be seen to down-weight high variance 
modes and up-weight low variance modes before calcu- 
lating the power spectrum of the weighted cut sky. The 
resulting power spectrum is then correctly de-weighted 
and de-convolved^. The overall effect is to minimize the 
cross-talk from the mask-induced mode-coupling. Ac- 
cordingly the QML estimator will be close to optimal in 
reconstructing skies from any theory where the power in 
each ^ is close to that predicted in concordance models. 
Since the observed power spectrum is very close to the 
ACDM theoretical prediction, any serious candidate the- 
ory must satisfy this criterion. Indeed, only if the true 
theory has a power spectrum closer to flat than to the 
concordance model will the PCL estimator typically per- 
form better than the QML. A corollary is that the PCL 
and QML estimators become identical for a theory with 
flat power spectrum'^ (equal Cgs). 

Efstathiou et al [TB] make a different rebuttal of the 
sensitivity-to-assumptions concern by showing that full- 



^ This interpretation of the QML estimator's operation is a 
harmonic-space equivalent to the pixel-space 'high-pass filter' in- 
terpretation given by Tegmark . 

^ Closely related to this is the better-known result that PCL es- 
timators for high I'a are nearly optimal for moderate sky cuts. 
Such cuts imply that the reconstruction is only sensitive to a fi- 
nite window over which the high-£ ACDM power spectrum 
is nearly flat. 



sky maps (albeit band-limited) can be made from the cut 
sky data (see also Ref. The difficulty with full-sky 

reconstructions is that they rapidly become unstable as 
the sky cut increases (unless the data are strictly band- 
limited, which is not true of the CMB). Furthermore they 
introduce a dependence on the underlying theory so that 
their conceptual benefits over the QML estimator are not 
clear (although, as shown in Ref. Ji3_, the sensitivity to 
the assumed covariance may be rather weak). For this 
reason we will not consider explicit sky reconstructions 
further in the present work. 

However the second subtlety is that one may not, in 
fact, want to optimally reconstruct the full sky correla- 
tion function. If we are interested in using a cut sky 
correlation function not to remove localized contamina- 
tion, but instead as a distinct quantity in its own right, 
the efficiency of the QML estimator at reconstructing 
the full sky becomes a hinderance. Starting instead from 
definition (|8| - the correlation of pixel pairs in a finite 
region of the sky - gives us a transparent interpretation. 
Therefore it remains of interest to examine carefully the 
PCL-derived C{9)™^, not because the QML estimator 
technique is in doubt as a way of extracting reliable full- 
sky information, but because the PCL technique explic- 
itly extracts information which is different. 



III. WHY IS 51/2 IS SMALL? 

In this section, we discuss the well-established result 
that is unexpectedly low (and much smaller than 
the full sky value), and consider the origin of this obser- 
vation from a harmonic space perspective. Known as- 
pects of the full sky realization are found to be behind 
the result, namely (a) the low amplitude, planarity and 
rough Galactic alignment of the quadrupole; and (b) the 
planarity and alignment of the octupole. 

Combining equations ^ and ^ shows that the value 
of 5[y2 derived from pixel-pair averages on the cut sky 



ocut 

'-'1/2 



L-g, Sw 



(10) 



It is not clear why various groups have seen differences 
(albeit minor) in measured numerically on the sky 



and defined by equation (10), but it probably re- 



lates to difficulties in designing stable numerical schemes 
which estimate C{9) directly from pixel pairs. 

To understand the origin of the observed value, let 
us first note that, in a typical realization of the isotropic 
ACDM model, the primary driver of the ratio '5™2/'5|^/2 

is the ratio Cl^^/C^"". This reflects the fact that, sub- 
stituting the theory CiS in equation Q, the dominant 
contribution is from the term quadratic in C2. However, 
due to the low amplitude of the full sky quadrupole in our 
particular realization, the 1 = 2 mode becomes subdom- 
inant in determining iSi/2- C3 then gives the dominant 




FIG. 1: (Color online.) An illustration of the low value of S'^^^ ^^'^ its origin. The upper left-hand panels show the pattern of 
ILC7 temperature fluctuations in ^ = 3, 5 and 7 modes from top downwards; the larger panel underneath shows the sum, in 
which anticorrelations between the modes cause power near the poles to be small. The shaded regions bounded by solid and 
dashed lines represent a 20% azimuthal mask in the Galactic and £ = 3 angular momentum planes respectively (defined in text). 
The large uppermost panel on the right hand side shows the values of for ^ variable width Galactic azimuthal mask (solid 
line). The dashed line shows the same result using QML, rather than PCL, reconstruction techniques; the dotted line shows 
the QML result when the mask is applied in the £ = 3 angular momentum plane, demonstrating that full-sky power can be 
efficiently hidden even from the QML estimator. The two right-hand panels underneath the main plot show the corresponding 
estimated values C2 and C3. Plotted points show the results from using the WMAP team's KQ85y7 and KQ75y7 masks for 
PCL (circles) and QML (crosses) estimators respectively, exhibiting the QML estimator's relative sensitivity to mask shape. 



contribution on the full sky, so that C'|'^'"/C|"'' deter- 
mines the magnitude of S^j2/^if2- 

Now the octupole of the observed reaUzation happens 
to be somewhat planar (although, as quantified below, 
not 'anomalously' so). Its preferred plane is, in turn, 
very roughly aligned with the Galactic plane. When the 
octupole is masked using one of the standard WMAP 
temperature analysis masks such as KQ75y7 (usable sky 
fraction /^ky = 70.6%) or KQ85y7 (/,ky = 78.3%), a 
significant amount of power in the octupole is hidden 
due to this approximate alignment, leading the recovered 
C^''"^ to be an under-estimate of the full sky value. Thus, 
according to the considerations above, drops sharply 
in response. 

Figure [1] illustrates this further by showing (top plot, 
solid line) the value of 5[y2 derived from the seventh- year 
WMAP j22j Internal Linear Combination map (ILC7) as 



the sky coverage of an equatorial, azimuthal mask is in- 
creased from to 40% {i.e. /sky drops from 100% to 
60%). When the sky is unmasked, the PCL and QML 
power spectrum reconstructions reduce to the full-sky es- 
timate, so that all results agree for the nil cut. For the 
moment we will focus on the behaviour of the PCL re- 
constructions, returning to the QML cases (dashed and 
dash-dotted lines) momentarily. 

As described above, the rapid decline in ^ ^ func- 
tion of increasing mask width is largely due to the cor- 
responding decline in Cf~^^ (illustrated in the lowermost 
right-hand panel of Figure [T]), which in turn can be linked 
to the progressive masking of the planar-concentrated 
power (see also the 20% sky-cuts illustrated in the CMB 
projections on the left of the Figure). The known pla- 
narity of the quadrupole is also important, in that the 
cut-sky estimates C'f'"^'" decline with increasing mask 
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FIG. 2: (Color online.) The difference between the cut-sky 
estimates Ci and the fufl-sky measured values Ct on the ILC7 
map, expressed for ease of viewing as a fraction of the fidu- 
cial theoretical best-fit Cf's published by the WMAP team 
PT] . The resuhs from a PCL and QML estimator for a 20% 
azimuthal sky-cut are shown by dots and crosses respectively. 
The shaded bands show the expected deviation of the cut-sky 
from the full-sky values. The larger, outer band represents 
the PCL deviation while the smaller, inner band (with dot- 
ted edges) represents the QML deviation. The low value of 
55^/2 arises because the PCL power spectrum reconstruction 
for I < 8 typically falls below, never significantly above, the 
full sky value. 



area (central right-hand panel) and so do not regain dom- 
inance over the octupole contribution. 

The described properties of multipoles £ < 3 are not 
quite enough, on their own, to account for the low S^^^- 
Working with an azimuthal mask of 20%, if we use the cut 
sky values Cf'~^^ for £ < 3 and the full sky values Ci for 



> 3, we calculate a value of S 



hybrid 

1/2 



3327 ^K*. The 



true cut-sky value for the same mask is — 1529 /iK^. 
We can account for this discrepancy by noting that the 
PCL reconstructions of two other multipoles, £ = 5 and 
£ — 7, are also rather low. 

The overall situation is illustrated in Figure [2] where 
we have plotted (as circles) the difference between full- 
sky and cut-sky power spectra {Cf'^^^—Ce) at /sky = 80%; 
these are calculated from the ILC7 map and scaled by 
the concordance theory C^'s [H]. The differences at 
£ = 3,5,7 are all somewhat outside the la variance (il- 



lustrated by the grey jagged band)^. 

The reason for the shortfall in reconstructed power in 
£ = 5 and 7 is not immediately clear from inspecting 
their individual patterns on the sky (small panels near 
top left of Figure [1}. Only when all the odd multipoles at 
£ < 7 are summed does the power become visually planar 
(see the larger Mollweide projection at the bottom left of 
Figure [1]). Thus cancellations between the £ — 3,5,7 
modes in the polar regions effectively hide power from 
estimators once the sky is masked. (We note that even- 
£ modes have no effect on the odd-^ reconstruction and 
vice versa, since these are decoupled when adopting an 
equatorially symmetric mask.) 



A. Behaviour of the QML estimator 

Having established the origin of the low S'^'J^, let us 
turn to the effect of using a QML, rather than PCL, 
estimator in reconstructing the full sky. It has been 
commented elsewhere [TQ that, for KQ85y7 masks, the 
QML estimator reconstructs most of the power in the full 
sky octupole; S'^J^^ is close to iSj^^j even for the larger 
KQ75y7 mask. We have reproduced these results; QML 
outputs are plotted as crosses in the panels of Figures 
[1] and [2j As discussed above (Section |ll]) , the QML es- 
timator re-weights its input maps to extract full-sky in- 
formation as efficiently as possible; hence the improve- 
ment is not surprising. Recall that, even when consider- 
ing anisotropic theories, the QML estimates for the full 
sky are expected to be superior (this is further reinforced 
in Section IV below). 

However, the QML estimator's ability to make an ef- 
ficient recovery of our full sky results does depend on 
the shape of the mask^. For an azimuthal mask covering 
a sky fraction > 20%, even the QML estimator starts 
to underestimate the power on the full sky (illustrated 
by the dotted lines in the right panels of Figure [T] the 
lower panel shows that the falling S'^J^^ tracks a drop in 

the power of the reconstructed octupole C^^^). These 
results show that, if the power is sufficiently localized 
within the mask, it cannot be reconstructed by any tech- 
nique. 

This interpretation of the results is confirmed by ap- 
plying an azimuthal mask in the plane (as defined below) 
of the octupole (dotted lines in all panels of Figure [T]) . 



The standard deviation illustrated in Figure [2] is defined as 
({Ci — C^)^)^/^, i.e. it is the 'cut-induced' variance introduced in 
Appendix |A 3[ equation | |A15[ |. We should note in passing that 
the variance is close to diagonal - i.e. correlations between the 
estimates for difli'erent £ are small in both PCL and QML cases. 
The PCL estimator is less sensitive to the exact shape of the 
cut than the QML estimator; this is to be expected given the 
simplicity of the former method (which, to a close approxima- 
tion, measures the power on the cut sky and scales it by the 
appropriate sky fraction). 
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The octupole plane is defined by rotating the map un- 
til the "angular momentum dispersion" statistic |23j for 
quantifying the planarity of multipole i is maximized: 



Li 



(11) 



(This maximization is achieved using a fast method de- 
scribed in Appendix[E|) Now the octupole is masked from 
the map very efficiently, and C'^^^ drops sharply as a 
consequence. In response, S'^j^^ becomes a severe under- 
estimate, at large sky cuts becoming even worse than the 
Galactic azimuthal-masked PCL estimator. 



B. Summary of the frequentist result 

The preceding material has shown that the small mea- 
sured value of is attributable to a series of some- 
what unlikely aspects of the observed realization. We 
now recap and discuss briefly the frequentist statistical 
significance of 5™2- 

The primary contribution is the low (254 //K^) 
quadrupole amplitude (with a P-value of 4% given the 
best-fit power spectrum). The planarity of the octupole 
can be assessed by considering the rotation-maximized 
value of Eq. (11), which on the ILC7 map is 0.926. 



The P-value computed from 10,000 isotropic realizations 



for observing > 0.926 is ~ 15%, 



our realiza- 



tion is not particularly unusual. The approximate align- 
ment of this somewhat planar octupole with the Galactic 
cuts typically used in CMB data analyses can reasonably 
be regarded as purely coincidental (P-value 21%). A 
consistent picture is found from assessing Figure [2j in 
which the £ = 3 PCL reconstruction deviates from the 
mean by about — 1.5(7. Similarly the shortfall of PCL- 
reconstructcd power in the i — b and 7 modes is a fluctua- 
tion of around —2a. None of these observations on their 
own look particularly unusual; the statistical anomaly 
arises instead because all of the low-^ PCL estimates are 
low. Despite its suboptimal nature the PCL estimator is 
unbiased, and the reconstructions at different -^'s are only 
weakly correlated, so one would have expected as many 
over- as under-estimates. 

The frequentist significance of the iS™2 result is con- 
nected, then, to a series of coincident minor anomalies in 
our realization. Only when combined in a specific way 
do these observations raise frequentist alarm. Of course, 
this simply shows that we have found a way to 'factor' 
the low P-value of S^^^, which does not by itself deter- 
mine whether the anomaly might point to theories be- 
yond the concordance model. Therefore in the next Sec- 
tion, we consider the feasibility of finding theories which 
are statistically preferred to the concordance theory in a 
Bayesian comparison of 5™2- 



IV. ANISOTROPIC THEORIES 

We have shown in the previous Section that the low 
observed 5^2 can be attributed to the fortuitous align- 
ment of power in the f = 3, 5 and 7 modes of the CMB 
(along with the planarity, and small full-sky amplitude, 
of the quadrupole). 

Broadly, one can imagine three distinct ways in which 
the small observed cut-sky power in the ^ = 3, 5 and 7 
modes could look less anomalous in an alternative theory: 

1. The Cj'^^ estimates could turn out to be biased in 
the ensemble mean of the true theory; 

2. The Cj^^ estimates could have a larger variance 
in the ensemble of the true theory, making the de- 
partures from the mean less significant; 

3. The true theory could correlate Cj''^^ estimates so 
that the likelihood of small cut-sky power in £ = 5, 7 
is greater once the small cut-sky power in ^ = 3 is 
known. 

Our main focus in what follows will be on (1); possibilities 
(2) and (3) will be mentioned where relevant. 

Given a true covariance matrix C related to the con- 
cordance isotropic theory by C = cconc _^ g^^g ^^^^ 
explicitly calculate the bias. 



Biasf ^ (Ce-Ce) 



and the variance 



(12) 



(13) 



where the false covariance matrix C'^°"'^ is used in con- 
structing estimators, but the true covariance matrix C 
is employed in taking the final ensemble average. For 
algebraic expressions the reader is referred to Appendix 
IX3l 

To gain a feel for how alternative theories can influ- 
ence the recovered power spectrum on the cut sky, let us 
consider the following specific cases. 

(i) Galactic contamination, i.e. residual errors in the 
Galactic signal subtraction. This is modelled by 
creating a template map of possible errors, taking 
1% of the difference of the WMAP7 ILC map with 
the V-band map (after smoothing to a common res- 
olution of 1°). The map gives us a rough handle 
on the form of the residual contamination to be 
expected (albeit with an unknown amplitude). In 
the ensemble, the template map is simply added to 
the observed CMB sky, yielding equivalent results 
to a theory with anisotropic Gaussian correction 
A = gg^^, where g represents the spherical har- 
monic coefficients of the contamination map. It 
has already been shown in Ref. [21] that this kind 
of contamination cannot improve the likelihood of 
iS™2: but the model remains helpful for our discus- 
sion below. 
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FIG. 3: (Color online.) For various anisotropic theories we plot the biases (left panels) and change in the diagonal part of the 
variance (right panels) compared to the isotropic case. In this context non-zero 'bias' can be desirable, for instance in the case 
illustrated in the uppermost panels, where it merely reflects that the full sky power spectrum is contaminated by the presence 
of the galaxy. The solid lines show statistics for PCL reconstructions of a /sky ~ 20% azimuthally masked sky; the dashed lines 
show the corresponding QML reconstructions. As expected, the QML reconstructions are generally more accurate (despite the 
anisotropy of the underlying theory); for more details see text. We scale the biases by the isotropic theory power spectrum 
and the change in the variances by the isotropic cosmic variance. The first three theories are simple existing models while the 
latter two are tests which are designed specially to reproduce the 3'^% anomaly. Note the different scales on each panel. 



(ii) Bianchi Vllh template. Using the algorithms 
of Refs. [551 US] we calculate a temperature 
anisotropy template for the Bianchi VII/i vector 
mode case with an amplitude of 35 /iK according 
to the best-fit parameters of Ref. j27j. In the en- 
semble this is added to the concordance CMB as 
with the templates considered above. Physically, 
such a setup can be motivated by the existence of 
anisotropic Bianchi modes which are well behaved 
at the initial singularity, although such models are 
fine-tuned. 

(iii) Quadrupolar Modulation. A strong quadrupolar 
modulation^ of the temperature field is known to 



reproduce the co-planarity of quadrupole and oc- 
tupole [5D]. The modulation is required to have a 
very large amplitude, yet be confined to low mul- 
tipoles. Dvorkin et al. [H] discuss how any early 
universe model of such a modulation must be care- 
fully tuned in harmonic space to avoid the leak- 
age of modulated power to high multipoles through 
projection effects. We approximate these consider- 
ations by modulating only the quadrupole and oc- 
tupole of isotropic realizations. Since a quadrupo- 
lar modulation of a multipole I couples power to 
£ ± 2, on the full sky our modulation only has an 
effect on multipoles I <b. 



Attention has also been given in the past to dipolar modulations. 
In the case of an equatorial azimuthal mask this can have only 
second order effects on power spectrum reconstruction, since it 
couples ^ to £ it 1 while the mask couples £ to ii: 2n. A different 



quadrupolar anisotropy, that of the inferred primordial power 
spectrum, has also been reported 1121 1131 l28l . However, Hanson 
et al. 1291 have identified WMAP beam asymmetries as the origin 
of this unconnected effect. 
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(iv) Picture and Designer theories. These are specific 
theories designed to investigate the best possible 
statistical gains to be made from anisotropic the- 
ories over the concordance case. We will describe 
them in detail in Section p^V B below. 

The left panel of Figure |3] exhibits the biases induced 
by each of these theories [defined by equation (12|]; the 
right hand panel shows the diagonal part of the variance 
[defined by equation (13)]. In both panels, the results 
from the PCL estimators are plotted as solid lines, while 
the QML results are shown by dashed lines. The sky 
cut imposed for these calculations is a 20% Galactic az- 
imuthal mask. 

In the first case, that of Galactic contamination, the 
'bias' reflects the added power from the Galaxy, visible in 
the full sky C^'s but naturally invisible to reconstructions 
made from the cut sky (in which the Galaxy is masked 
away). Both the PCL and QML estimators therefore 
become equally 'biased', but this is, in fact, a desirable 
feature: they are rejecting the contamination. Note that 
the zig-zag pattern in the bias arises from the rough equa- 
torial symmetry of the Galaxy, which results in a much 
stronger coupling to even, rather than odd, ^'s. On the 
other hand the zig-zag in the biases actually observed 
(Figure [2| is larger at odd, rather than even, ts. Fi- 
nally, the apparent biases and extra variance of spatially 
localized contamination tends to grow towards high £ as 
a fraction of the full sky power, whereas the observed 
discrepancies are confined to low £. 

The second case (Bianchi contamination) is similar in 
that it adds a template to the concordance covariance; 
but because the power is not localized within the mask, it 
is now visible even on the cut-sky. As expected from our 
earlier considerations, the QML estimator in this regime 
reconstructs the full sky Ci power with a smaller bias and 
variance than the PCL case. The increased variance (of 
order 20% of the cosmic variance) is more significant than 
the bias (of order 2% of the power spectrum). This can be 
understood by noting that, since the Bianchi signal has a 
small rms power of ^ 12fiK^, the individual elements of 
the template covariance contribution A are much smaller 
than the elements of the concordance covariance matrix 
Qconc^ Expanding an expression for the variance (A14) 
highlights the existence of cross-terms in C'^°"'^ and A; it 
is these leading order contributions which give the larger 
variance. 

Let us now turn away from additive modifications 
to the concordance theory, and instead discuss the 
quadrupolar modulation. When analysed on the cut sky, 
the power at low multipoles is hidden from the PCL es- 
timator (but less so from the QML estimator), leading 
to a negative bias. The modulation also couples £ zt 2, 
creating power on the full sky in ^ = 4, 5; this accounts 
for the spike at these multipoles in the variance of the 
estimators. The extra power is further spread to higher 
£ by mode-coupling resulting from the masking opera- 
tion. The result is that the PCL estimator over-estimates 
power in multipoles 5 < ^ < 15; note that, because the 



power spectrum is rapidly decreasing, a small leakage of 
power to high £ from the low multipoles can result in a 
substantial bias. Once again the QML estimator fares 
better, more efficiently confining the contamination to 
low £. 

The quadrupolar modulation behaves qualitatively as 
expected, hiding power at low £; this is the right sort of 
effect to reproduce the low Sy^ ^-^d therefore produces 
a small increase in the likelihood of the observed value. 
However, Figure |3] shows that the biases from this theory 
are rather small. Therefore, rather than focus on this 
model, we can go one stage further and consider tuning 
Gaussian models to fit the value of S'^'J^ as closely as pos- 
sible. These theories are less transparent in their physical 
meaning, but are guaranteed to give a better fit to the 
observed properties of the sky. 



A. Picture theory 

Consider the theory which exactly matches the ob- 
served CMB; it has covariance matrix C = aa^ where 
a represents the observed ILC a^m's. Because C has 
zero variance in any direction orthogonal to the observed 
data, it has an infinite likelihood (or, more correctly, a 
likelihood bounded from above only by noise in the ex- 
periment). 

The ensemble for this theory is a series of pictures of 
our own CMB sky (represented here by the ILC map), 
scaled by a Gaussian random amplitude of unit vari- 
ance. Consequently the biases exactly match the values 
of Ci — Ci for our observed sky (see Figure [3|. However, 
because there is only one mode (the amplitude scaling of 
the entire sky) , the variances become extremely large. To 
build intuition, consider the cosmic variance of the con- 
cordance model, in which the variance on Ci decreases as 
2£+l. This arises solely because of the additional modes 
available at increasing £; in the picture theory all modes 
are perfectly correlated, so the cosmic variance does not 
decline in this way. 

In spite of the divergently large likelihood for the pic- 
ture theory, the variance means that our observed value 
of iSJ^"2 has a finite likelihood which can be calculated by 
Monte Carlo simulation of the ensemble. In Figure |4] we 
plot the log likelihood for the concordance ACDM model 
(solid line) , the picture theory (dash-dotted line) and the 
designer theory (dashed line), the last of which we will 
return to momentarily. 

The improvement in the log likelihood of for the 
picture model (over the isotropic concordance case) is 
Aln>C = 3.7. This disappointingly modest improvement 
can be seen to result from the large cosmic variance in a 
theory with only one degree of freedom: while it peaks 
near the observed value, the S^^^ likelihood function for 
the picture theory is extremely broad. The broadness in 
turn impacts upon the peak value because the likelihood 
must be normalized to one. 
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pendix O . The freedom is truncated at i„ 



10; we 
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FIG. 4: (Color online.) The likelihoods for Sl'J\ compared be- 
tween three theories: the isotropic concordance theory (solid 
line), the 'designer' theory (dashed line) and 'picture' (dash- 
dotted line) theory. The latter two are specifically designed 
to reproduce low S^^^. The improvement in the log likelihood 
over the concordance cases are respectively 4.2 and 3.7, which 
are very small improvements given the fine-tuning involved. 
Since these theories should produce the greatest possible gains 
in likelihood, the value of the observed iS^yj statistic is not a 
strong objection to the concordance theory. 



B. Designer theory 

We have examined the picture theory in which the 
likelihood of our own observed CMB is, by design, di- 
vergently large. The hkelihood was shown to barely 
favour the theory because the variance on this value is so 
large. In this section we search for a theory with simi- 
lar properties to the observed sky, but allow power to be 
spread through many more modes, so that the variance 
in high-order statistics such as is better controlled. 

A full explanation of the mathematical construction is 
given in Appendix [Cj Our search is over all positive defi- 
nite covariance matrices C, corresponding to all Gaussian 
theories with zero mean ((AT) 0), subject to two sets 
of constraints. Firstly, the covariance matrix is required 
to have full-sky theory C^'s [defined by Eq. ([3|] equal to 
the observed ILC values (Q = Cf^^). Secondly, the PCL 
estimator applied to the theory on the 20% azimuthally 
masked sky is required to find zero power in ^ = 2, 3, 5 
and 7 [i.e. BiaSf = — at these £'s). In order to satisfy 
these results simultaneously, our technique naturally in- 
troduces anisotropic correlations between different mul- 
tipoles. Substantial freedom remains, which we use to 
minimize the cosmic variance of the final theory (see Ap- 



adopted the concordance covariance for all £ > -^max, but 
verified that our conclusions are insensitive to this choice. 

The CMB projection labelled 'Designer' in Figure [3] il- 
lustrates an actual realization from this model (although 
only the £ = 3, 5 and 7 modes are plotted). One can 
see that the theory is very efficient at localizing power 
in modes confined within our specified 20% mask (while 
keeping the full sky power spectrum to the specified val- 
ues). The bias panel for this theory shows that, accord- 
ingly, no power is detected by the cut-sky PCL estimator 
in £ = 3, 5 and 7. (The plot shows that some of the bi- 
ases are actually smaller than — C^, which is as expected 
since C]'^^ > at the corresponding multipoles.) At low 

the variance on the C^s is smaller than for the con- 
cordance model {6Vu < 0), because the reconstructed 
power in these modes remains close to zero in all real- 
izations. For i > 10, there is a spike of larger variance 
arising from the mask-induced contamination similar to 
that described for the modulation model. At large £, the 
variance tends to the standard concordance variance for 
the estimators {5Vu = 0). Once again, the QML estima- 
tor performs better in minimizing both bias and variance 
compared to the PCL case at nearly all i. 

We can now return to Figure [4] which displays, as 
a dashed curve, the likelihoods for our designer the- 
ory. We described above how the power localization 
at low £ favours a low S^J^j accordingly the peak like- 
lihood (at logio '^1/2//^^^ ~ •^■^) considerably smaller 
than the equivalent value for the concordance theory 
{iogio '^1/2/^^^ ~ ^■^)- However, despite being mini- 
mized by spreading power through more degrees of free- 
dom, the variance of the designer remains large and 
consequently the improvement in likelihood is modest 
(Aln£ = 4.2) despite the dramatic increase in the num- 
ber of degrees of freedom needed to construct this theory. 

It is clear from Figure|4]that to obtain significant gains 
in likelihood for 5™2j o^i^ needs to achieve far smaller 
cosmic variance on 5™2- But the designer theory plau- 
sibly gives near the smallest possible variance on this 
quantity. In particular. Appendix [C] derives a minimum 
bound on the variance of iS™2- The lower bound can 
be understood as arising from a suitable isotropic limit 
(which is unattainable in practice, but provides a prov- 
able lower limit for attainable theories); isotropic theo- 
ries minimize the cosmic variance for a given power spec- 
trum, because they maximize the number of independent 
modes and spread power through these modes as evenly 
as possible. The lower bound calculated from the appro- 
priate isotropic test-case is cr^jini^i%) — ^■'^ ^ lO^^K^, 
compared with the variance on the designer theory of 
(j'^{Slf^) = 2.9 X lO^/uK*. Thus the designer theory de- 
tailed in this section almost saturates the variance limit; 
we may be confident that no Gaussian theory can have a 
significantly more peaked likelihood. 
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C. Summary and discussion 

The two theories we have discussed in the preced- 
ing two sections (the first giving an infinite likelihood 
for the observed sky; the second tuned as far as we 
can to give a large likelihood for the single value Sy^) 
strongly suggest that no anisotropic Gaussian theory can 
improve the likelihood of the observed S'^^^ by rnore than 
A In £ ~ 5. Because of the careful fine-tuning of these 
models, they form a plausible upper bound for the sta- 
tistical gain available. 

How can we interpret this very modest likelihood gain? 
From a Bayesian perspective, correlations between the 
primary temperature (from the high- 2: last scattering sur- 
face) and the integrated Sachs- Wolfe signal (from local 
structure) are implied in any model aligning low-£ power 
[TBI l3H 132] . Therefore any realistic physical prior prob- 
ability is very small relative to the isotropic ACDM cos- 
mogony; the posterior probability ratios will still vastly 
favour the latter theory. 

An alternative argument is as follows. Up to -^max = 10 
(and excluding monopole and dipole), the anisotropic 
theory has approximately 6 900 degrees of freedom com- 
pared to the isotropic case with 8 degrees of freedom. 
Thus the improvement in log likelihood per degree of free- 
dom is of order 10^'^. While this is not a strictly Bayesian 
interpretation, it does suggest that our statistical gain 
has been achieved only with enormous fine-tuning. 

Both of these lines of reasoning suggest strongly that 
the observed value of can never constitute strong 
evidence in favour of any Gaussian theory posited as an 
alternative to ACDM. One escape route from this result 
is to consider, rather than the absolute value of 5™2' 
the ratio 5^2/^1/2- In the case of the picture theory, 
for instance, one then has an infinite likelihood for the 
observed value of the ratio (because the one available ran- 
dom degree of freedom - the amplitude - cancels between 
numerator and denominator). This immediately demon- 
strates that there is no upper bound to the likelihood 
gain for such a statistic. We would suggest, however, 
that the existence of an upper bound for the original 
statistic, iS™2 7 an attractive property - precisely be- 
cause it allows for an understanding of the Bayesian the- 
oretical improvements available without detailed physi- 
cal modelling. The most convincing way to show that 
the observed sky is anomalous would therefore be to find 
a statistic encapsulating the planarity and correlation of 
power which has large, but not trivially infinite, likeli- 
hood gains available. Starting from the results of Section 
such a statistic might be developed from physical con- 
siderations on the full sky. 



V. CONCLUSIONS 

There is a classic difficulty in understanding large and 
complex datasets such as those produced by WMAP 
and, in the future, Planck: they contain so much in- 



formation that statistical anomalies can be found with- 
out any difficulty. We have taken as an example the 
purported anomalous aspects of the angular correlation 
function. Some previous work claims that, after consid- 
ering these anomalies, the entire cosmological paradigm 
is to be doubted [lOj : other authors claim that apparent 
anomalies can be dismissed as the product of a posteriori 
analysis |16| . Yet a posteriori reasoning must be allowed 
in science, since otherwise we would rarely, if ever, rec- 
ognize failings of our existing knowledge. 

The contrary statistical claims relating to S'J^j are rec- 
onciled by appreciating that, without an alternative the- 
ory to test against, there is no unambiguous significance 
to any anomaly. We have therefore presented an alterna- 
tive approach to this puzzle: we examined the origin of 
the low S'™2 in harmonic space, and then attempted to 
find theories that reproduce the required patterns. 

In the process we noted that the cut-sky correla- 
tion function contains identical information to the PGL 
power spectrum estimates. We therefore used the PGL 
estimates for the majority of our results, but also 
demonstrated that the standard QML techniques pro- 
vide more reliable reconstructions of the full sky, even 
when anisotropy is suspected. We informed our intu- 
ition about the behaviour of the estimators by consider- 
ing simple anisotropic modifications to the concordance 
models (contamination, Bianchi and quadrupolar mod- 
ulation theories). This showed explicitly that the QML 
estimator biases introduced by anisotropic theories were 
smaller than or comparable to the PGL case. 

Then, by attempting to construct anisotropic Gaussian 
theories which improve the likelihood of the low Sy^^ 
demonstrated that no significant gains in likelihood for 
this single statistic are available. Since there is no sug- 
gestion in the observed sky that the underlying ensemble 
is significantly non-Gaussian |33| . it is implausible that 
post-Gaussian corrections would substantially change our 
results. We therefore conclude that the anomaly is 
not likely to point to new physics. 

If it does have any meaning, the S'^'J^ anomaly (and the 
underlying shortfall of power seen by PGL estimators) 
does not indicate a vanishing large-scale correlation func- 
tion, but rather is related to alignments of low-£ power on 
the full sky (Section III I. It is likely that full-sky statis- 



tics can be constructed which capture these unexpected 
correlations better than ~ ^^d these could evade our 
likelihood limits. However, we argued that more trivial 
modifications (such as taking the ratio 8^2/ '^1/2) which 
sidestep our constraint by attaining an infinite likelihood 
under the 'picture' theory (C = aa^^) are not helpful; 



see Section IV G In other words it is highly desirable to 



choose statistics, such as 5™2i that do allow for a finite 
limit to be placed on the Bayesian statistical gain avail- 
able under a wide class of alternative straw-man models. 
Gonsidering the magnitude of that limit is then, in our 
view, a plausible way to probe the significance of a pos- 
teriori anomalies. 
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Appendix A: Quadratic Estimators: some useful 
results 



In this Appendix, we summarize some technical details 
omitted from the main paper alongside useful results per- 
taining to the two most common quadratic power spec- 
trum estimators: the pseudo-C^ and quadratic maximum 
likelihood estimators. These will be introduced and com- 
pared in a single quadratic estimator framework to gain 
insights into their similarities and differences. Since we 
work in harmonic space, we first explain the sky-masking 
operation. 



1. Cutting the sky 

It is standard practice in CMB analysis to remove re- 
gions of the sky in which contamination from the Galaxy 
(or other undesirable sources) is suspected. This is ac- 
complished by masking the temperature field and then 
constructing measurements based solely on the masked 
data. In harmonic space, the masked temperature ex- 
pansion coefficients aim are related to the unmasked aprn 



12 



where 



(Al) 



K.^tm' = J dnY;„xn)Yi,„A.^)Mm- (A2) 

Here M{n) is within the masked region and 1 outside. 
[See Appendix|D]for a brief discussion of a hidden numer- 
ical pitfall in equation (A2|.] We can write expression 



(Al) compactly as the linear transformation 



Ka, 



(A3) 

where a is a vector composed of the a^m's. 

It will be helpful to note that K is both idempotcnt 
(K^ = K) and Hermitian (K^ = K). These identi- 
ties may both be derived straight-forwardly from equa- 
tion ( A2 ) ; together they allow much flexibility in ma- 
nipulating certain equations. In these appendices, the 
addition of a tilde will represent masked quantities and 
operators; thus b — Kb for any data vector b, while for 
any matrix M we write 



M = KMK. 



(A4) 



implicitly taking advantage of the Hermitian property. 
For most of our numerical results we have assumed the 
mask is aziniuthally symmetric, M{9, (j>) = M{(p). This is 
a reasonable approximation to true Galactic masks, and 
results in enormous computational simplification because 
K becomes sparse. 



Ku'Smm' (no sum). 



(A5) 



However all algebraic results are obtained with no such 
assumptions and are applicable to any type of mask. 

Since the estimators considered here are quadratic in 
the cut-sky film's, we may write for a generic estimate 



Cf = a^R^a 



(A6) 



for some set of matrices R^. Before explicitly defining 
these matrices, we describe a helpful notational trick and 
discuss a couple of generic features of quadratic estima- 
tors. 



2. A helpful notational trick 

Recall that in Sec. |ll]the power spectrum Ci observed 
in our single realization of the full sky was defined as 



2i 



-El 



(A7) 



and the theoretical power spectrum Ci was taken to be 
the expectation value of Eq. (A7), Ci = (Ce). We will 



henceforth use a shorthand for such expressions, writing 



C,= 



2£ + 1 ' 
Tr CA^ 
2i+l 



(A8) 
(A9) 



where C = (aa^) is the theory covariance matrix and the 
elements of the matrices are 



(no sum). (AlO) 



Thus is the projection operator into the spin-£ sub- 
space. The following two properties of are useful: 



Tr A' = (2£-f 1), 



bggi^^ (no sum). 



(All) 
(A12) 



Introducing the set of matrices A^ produces considerably 
more compact and readable equations at later stages. 



3. Expectation and variances 



Given the cut sky power spectrum estimates Gi defined 



by equation ( A6 ) , we have respectively 



(Cf) = Tr CR^ 



(A13) 

2TrCR^CR^' (AM) 



(aat) 



for the expectation and variance, where C 
KCK is the cut-sky harmonic covariance matrix. 

The estimator variance Vw characterizes the random 
error associated with estimating the ensemble quantity 
Ci from a single masked realization. This is the appro- 
priate quantity for most results in the paper and appen- 
dices. However, occasionally one wants a measure of the 
extent to which the cut sky estimators accurately predict 
the full sky (rather than ensemble averaged) power. A 
suitable quantification is given by the following, which 
might be termed the 'cut-induced variance' (since it is 
necessarily zero on the full sky): 



CIV«- EE 



Cf/ — Cf 



where Z = R/. 



21+1 



= 2 Tr CZ^CZ^ 
(A15) 

(A16) 



The diagonal part of the cut-induced variance for a 20% 
azimuthal sky cut is plotted as a band in Figure [2j It 
may be verified that CIV^^ ' ^ Vu> ; expanding expression 



( A15 1 shows that the cut-induced variance is equal to the 



sum of the cut-sky and full-sky cosmic variances minus a 
unique cross-term. 



4. The reconstruction matrices 

Let us now turn to specific reconstruction methods. 
The R^ matrices for the PCL case {e.g. Ref. [18]) read 



^PCL 

with MfJP^ 



^(Af-i)F,9LA^7(2£' + l), (A17) 

e' 

Tr A^A^7(2^' + 1): (A18) 
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where the 2£' + 1 normahzation on each of these expres- 
sions is conventional. It may readily be verified that 
these form unbiased estimates for the full-sky, ensemble- 
averaged CiS in an exactly isotropic theory since the 
covariance matrix C may be written as 



=3 estimators, 20% azimuthal mask 



(A19) 



In this isotropic case one can write the covariance matrix 
on the cut sky 



C = ^ A' 



(A20) 



showing that our harmonic-space plays the role of 
in the notation of Tegmark's pixel-space exposition of the 
QML estimator [19]. Accordingly, the QML reconstruc- 
tion matrices are written'': 



QML 



C-^A^ 



where mT^^ = Tr Q-^^'C-^K' 



(A21) 
(A22) 



In these expressions it is possible to substitute for C 
a false covariance matrix C'^""'^ which differs from the 



true theory matrix used in expressions (A13), (A14| and 
(A15). This represents the state of affairs when an incor- 



rect assumption is made by an analyst about the isotropy 
(or some other aspect) of the underlying theory, as sim- 
ulated in Sec. |IV| above and Appendix |A 5| below. In nu- 
merical construction of the QML estimators we assumed 
a variance on the monopole and dipole of lOOO/iif^. This 
effectively projects out information which is contami- 
nated by cross-talk from the monopole and dipole, and is 
likely to be over-cautious, but residual foregrounds make 
it hard to quantify the uncertainty in the WMAP zeroing 
of these quantities. (See also the discussion in Ref. [TO].) 
We verified the results were not sensitive to the precise 
variance assumed on £ = 0, 1. 

As for the PCL case, the QML estimates are unbi- 
ased {{Cf^^) = Ce) if both C and C^™'^ are isotropic. 
If C = cconc i\yQy are also optimal in the sense that 
no unbiased estimator (quadratic or otherwise) can start 
from the cut-sky a^m's and produce Ci estimates with a 
smaller covariance ellipsoid pS]- 

However the QML estimator has sometimes been criti- 
cized for the dependence of its optimality on the assumed 
covariance matrix - it appears to rely on the structure of 
the assumed underlying theory in a way that the PCL es- 
timator does not. [No C matrices appear in expressions 



' In equation and below we adopt the convention of as- 

suming tile existence of an inverse for tlie singular matrix C. 
Practically speaking one can regularize the matrix using an ad- 
ditive numerical noise term, or simply use the pseudo-inverse, 
since always appears conjugated by K, the null directions 

of which lead to the uninvertibility. 
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FIG. 5: (Color online.) The weighting which different esti- 
mators give to full-sky modes of differing m in estimating the 
Ce's (here illustrated for £ — 3 with a 20% azimuthal mask). 
The full sky estimator ( thin solid line) by definition weights 
each m equally; see Eq. ( |A7[ ). Masking the sky then using the 
QML estimator (dashed line) comes close to reproducing this 
weighting, despite the loss of information associated with the 
first operation. The action of masking the sky then estimating 
1 = 3 power using the PCL technique (thick solid line) favours 
\m\ = 2 and m — while downweighting \m\ — 3 modes, thus 
giving a less reliable estimate of the full sky power. Similar 
trends are seen at other £. 



(A17) and (A18|.] When the covariance matrix assumed 



may be incorrect, is it safer to use the PCL estimator? 
The answer is 'no'; in fact the anisotropy-induced errors 
in QML estimates are typically smaller than those in PCL 
estimates. We now explain why this should be the case. 



5. The relationship between 
QML and PCL estimators 

By examining the relationship between QML and PCL 
estimators, it is possible to show that QML estimators 
(derived on the assumption of isotropy) are statistically 
superior even if the underlying theory breaks isotropy in 
an unknown way. In outline, the QML estimator can be 
understood as minimizing the cross-talk from variance 
in neighbouring i modes. It can only do this by having 
prior information about the shape of the spectrum in the 
region of the £ estimate under construction. But since the 
observed sky - regardless of its isotropy - has a power 
spectrum with a very similar shape to the theoretical 
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model, the QML estimator is expected to be superior to 
the PCL estimator under any model compatible with the 
observed sky. 

This argument does rely on the QML estimator not 
placing undue weight at given i on any particular m- 
mode. The full-sky estimator by definition gives even 
weight to each m [see expression (A7|]. To accu- 



rately reproduce power spectra, cut-sky estimators must 
trade off equal weighting of the m modes against down- 
weighting modes which are particularly contaminated by 
mask mode-coupling. Derived on the assumption of an 
isotropic theory, it is not clear whether the QML estima- 
tor will do a better or worse job than the PCL estimator 
in this limited sense. A calculation shows, however, that 
the QML is superior - it actually weights the full-sky 
m-modes more evenly than the PCL estimator. This is 
illustrated in Figure [5j which shows the weight given to 
each m mode on the full sky under the composite opera- 
tion of masking-then-estimating. The weights are flatter 
for the QML estimator (dashed line) than for the PCL 
estimator (thick solid line). It is actually a fortuitous 
result of the shape of the concordance power spectrum 
that this is true; otherwise the reliability of the QML es- 
timator would depend more sensitively on the underlying 
anisotropic theory. 

We now demonstrate the crucial result that the PCL 
and QML estimators become identical for a flat power 
spectrum. The covariance matrix is then proportional to 
the identity, C — al, so that the QML reconstruction 
matrices reduce to 



QML 



-2 M7^} A^' = a-^ M^^^KA^'k (A23) 
M"^^^ = Tr A^A^' = a'^ Tr A^A^' (A24) 

where the final expression for M^'^'" is obtained by ex- 
panding the masked expressions (A^ = KA^K) and us- 
ing the condition K.^ 



Al 



K ob tained in Section 
By rewriting equation (A6| as = a'^'KR^Ka, it fol- 
lows that the R^ appearing in any statistical expression 
must arise in the combination KR^K. This means that 
one can, without loss of generality, dispense with the ex- 
plicit masking K matrices in expression ( A23 ) , again re- 
lying on the identity — K. Finally, the factors in 



equations (A23I and (A24| may be mutually cancelled. 



since M appears only in the expression for R. Thus we 
may write 



r; 



QML 



where 



m7;a^' 

Tr A^A^ 



"-PCL 



(A25) 
(A26) 



where the ~ symbol should be read as 'yields identical 
estimates to' ~ i.e. it denotes an equivalence relation, not 
an approximate equality of the matrix elements. To ver- 
ify this, compare the above with Eqs (A17 A18), noting 
that the missing factors of 2£' + 1 are conventional nor- 
malizations which exactly cancel between the two lines. 

This demonstrates that, if C = al, QML estimates 
are identical to PCL estimates. The result does not rely 



on any assumptions about the Galactic cut being small. 
However, for a small Galactic cut, the mask operation K 
acquires a narrow banded structure at high-£ such that 
each £ is effectively coupled only to a finite range of £' 
from £—A£ to £+A£. Thus, even though the concordance 
covariance matrix is not proportional to the identity, at 
high £ its relevant, local structure can be adequately ap- 
proximated as such. This demonstrates the equivalence 
of the QML and PCL estimators in this regime. 

To understand the difference between QML and PCL 
estimators one can reverse the argu ment above [i.e. 
one replaces A^ by A^ in equation (A17) then com- 



pares with equation (A21), finding the latter simply pre- 
weights the data]. 

It follows from all this that the QML estimator can, 
roughly speaking, be expected to remain superior to the 
PCL estimator for any theory compatible with our sky. 
To demonstrate this explicitly, we draw random covari- 
ance matrices with power spectrum equal to that of the 
observed ILC, but taking a random distribution of power 
between different m modes. Explicit calculations for each 
of these theories show that the QML biases (A13I and 



variances (A14) are significantly smaller than their PCL 
counterparts. A specific illustration is given in Figure [6j 
where we plot a histogram of the biases on Cf'~'^ (solid 
line) and C^^^ (dashed line) for 200 000 random the- 
ories. The PCL estimator has a significantly broader 
distribution of biases than the QML estimator, showing 
that the QML technique typically produces more reliable 
estimates of the full sky power. 

In conclusion, for any anisotropic theory which is com- 
patible with our observed sky, estimates for the power 
spectrum formed using the QML technique (despite be- 
ing derived assuming isotropy) are expected to be supe- 
rior to PCL estimates for the same quantity. 



Appendix B: Estimators for C{9) and 5*1/2 

In this section we demonstrate that the pixel-based 
cut-sky correlation function, 

, _ /dnidn2M(ni)M(n2)r(ni)T(n2) 
^ ' ~ /dnidn2M(ni)M(n2)(5e ' ^ ' 

where Sg — 6{ni-h2 — cosd), is identical to the PCL-based 
estimator' 



8 



C(0)PCL = i_ J2^2£ + l)C'PCLp,(cos( 



(B2) 



This has been stated before [T7] and is implicit in other 
works |351 136j . but an explicit demonstration has not, to 



* We are grateful to A. Challinor for initially drawing our attention 
to this equivalence. 
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where the matrix 
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FIG. 6: (Color online.) An example of how the QML es- 
timator typically remains superior to the PCL estimator is 
given by comparing, for 200 000 random anisotropic theories, 
the bias on the cut-sky power spectrum estimates (here illus- 
trated for ^ = 5 with a 20% azimuthal mask). The width of 
the curves show that, for a given theory, the QML estimator 
(dashed curve) is typically significantly less biased than the 
PCL estimator (solid curve). Similar results hold at other ^'s 
and for the extra variance induced by the anisotropy. 



our knowledge, appeared in the literature. The argument 
holds for any weighting function M(n): if M takes values 
other than and 1 t he h armonic-space matrix K [still 
defined by equation (A2)] is no longer idempotent, but 



no part of the proof below is affected by such a change. 
Equation (Bl) may be expressed 



1 



E 



— ^C,P,(cos0), (B3) 
47r 



where F(Q)^ equal to the denominator of (Bl), is a nor- 



malizing function dependent only on M(n). Here Ci is 
the power spectrum of the masked (or weighted) sky, 



atA^a 



n,.rPCL/^PCL 

1^1 II III Kjfi , 



(B4) 



where M^^'" is defined by equation ( AI8). A power spec- 
trum can be calculated from C{9Y : 



Cr' = 27r / C(6')™*P<>(cos6')dcos6i 

PCLAPCL 



iCgi = Tu'lvL,p C 



(B5) 
(B6) 



Tffi — 



2f + 1 Pi{cose)Pii{cosl 

m 



'dcos6' (B7) 



depends only on F{9) (and hence only on the sky cut, 
not any aspects of the theory or realization) . 

If we temporarily consider a theory which is isotropic, 
we have (C(6i)™*) = C{e) = (C(6')PCL) ^^^^^^ 

linearity, ( C*™*) = (Cf^^). But then, comparing with 
expression ( |B6| , the only possibility is that the matrix 
Tggi is the inverse of the matrix Mfp^ - in other words 
that 



^cut 



(B8) 



We reiterate that neither T nor M^*-''" depend on either 
the underlying theory nor the particular realization in 
hand, and therefore this result is independent of isotropy. 
Finally, one inverts the Legendre transform to gain the 
desired result, 



(B9) 



valid for any theory. It follows immediately that 
derived from expression ( 10 1 must be mathematically 



equivalent to pixel-based estimates. 

The above proof depends on the invertibility of Mf/P^. 
It is well known that this matrix is not invertible for all 
choices of sky-cut (although for all masks considered in 
the present work we have found it to be well-behaved). 
However, in any limit where det M^'-^'^ — > ■ 0, one must 
have detT — > 00. According to definition (B7), this will 
occur if and only if F(9) — > for some 9 - in other words 
if and only if the cut sky contains, in the limit, no two 
points separated by certain values of 9. It follows that, 
whenever the entire correlation function can be recovered 
from the cut sky, the PCL estimates can be made and the 
relationship proved above holds. 



Aside: biased high 

We should note in passing that, because S1/2 is 
quadratic in the Cis, its expectation value does not fol- 
low simply by replacing the C^'s with the C^'s in Eq. 
([6]); rather, the full sky expectation value reads for the 
concordance theory 



Sl/2 = {S1/2) = E ( '-^^'^i + 



2C^ Sffi 



(BIO) 



The second term in Eq. (BIO) contributes very signifi- 



cantly to the expectation value, which breaks down term- 
by-term as Si/2 = (4.9 + 3.7) x W = 8.6 x 10* ^K* for 
the WMAP5 best fit 's. This means that (for instance) 
the comparison of our full sky with the theory values in 
Table 1 of Ref. [TD] is not strictly appropriate; with the 
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cosmic variance term included, the observed S1/2 statis- 
tic is made to look even more discrepant with the theory. 
The expectation value of calculated directly from 

the cut sky Ci is also increased, for similar reasons, but 
by a larger amount corresponding to the larger variance 
on the cut sky power spectrum estimates: 



2Cjt 



(2£+l) 



(Bll) 



where Vw is the variance of the 's, given by expression 
(A14). This biases cut-sky S1/2 values to be higher than 
their full-sky counterparts, for instance by ^ 8300 /iK* 
for PCL and ^ 1100 /iK** for QML reconstructions with 
a 20% azimuthal sky cut. 

At face value, such biases make it more surprising that 
the measured 1S1/2 should be so small and S'^^ even 
smaller. However, the standard deviation of 5^/2 very 
large 10^ A*K^) so that the biases do not have a sig- 
nificant impact on the frequentist significances. Further- 
more, and regardless of the magnitude of the biasing, 
the Monte Carlo techniques used in Refs [10] and [16] 
are anyway valid (they automatically take the biasing 
into account). We have therefore included the discussion 
above only for pedagogical interest. 



Appendix C: Designer theory 



In Section IV Bj we used a theory with covariance ma- 
trix C determined by two considerations: 

(i) The full sky power spectrum is given by Cg — Cf^'^ , 
where Cf"'^ is the observed power spectrum on the 
full-sky ILC map; 

(ii) The cut sky power spectrum (PCL estimator using 
a 20% azimuthal cut) is biased, i.e. its expectation 
value does not equal the full sky power; instead we 
set {Cf^^) = C™*. To reproduce the causes of our 
own sky's low S^f^, we set C^™' = for £ = 2, 3, 5 

and 7; at all other £, C^™* = Cj^^. 

Both constraints are linear in the full sky covariance 
matrix [see Eqs. (A9) and (A13l]. While one can 



construct a matrix C satisfying these constraints using 
straight-forward linear algebra, the result is not unique 
and furthermore it is hard to enforce that C be positive 
definite (as it must be to define a valid covariance ma- 
trix). Therefore we adopted the package CVXOPT^ to 
find a suitable theory C within the set of positive defi- 
nite matrices. CVXOPT allows us to find a unique solu- 
tion by minimizing any convex quadratic form, for which 



|http : //abel . ee . ucla . edu/cvxopt/ this package performs con- 
vex optimization within a cone. (The space of positive definite 
matrices is an example of a cone in this sense.) 



we chose the function Tr C^. The choice at first appears 
arbitrary; but schematically, by considering the eigenval- 
ues of C, one can imagine that minimizing Tr tries 
to 'equalize power between as many modes as possible'. 
This in turn is motivated by our attempt to minimize the 
cosmic variance on iS™2 > leading to the most peaked like- 
lihood function (and hence best possible likelihood gains 
over the concordance theory). 

These statements can be made somewhat more math- 
ematically concrete, but we did not find a full proof that 
minimizing Tr minimizes the cosmic variance of 5™2 • 
Instead, Section |IVB| gave a strict lower bound on the 
variance of 5™2 1 and stated that our theory comes close 
to saturating this limit. The remainder of the present 
appendix explains the origin of such a variance fioor. 

We start by considering, for simplicity, the full sky 
Si/2. We also temporarily approximate the likelihood 
function as Gaussian. Both of these simplifications will 
be removed in due course; in particular, all of our numer- 
ical results use the exact likelihood. The variance of the 
Si/2 statistic may be written 

((5i/2)2) - (5i/2)2 = 4c^sVf^sc + 2Tr sV'^sV'^, (CI) 

where c is a v ector composed of the C^'s [as defined by 
equation (A9)], and V^** represents the full sky cosmic 
variance. 



TAlS _ 

^U' — 



2Tr CA^CA 

(2£+l)(2^' + l)' 



(C2) 



We wish to minimize Eq. (CI I with respect to C while 



keeping constant. Using a standard Lagrange multi- 
pier technique, one obtains 

^ s,,,,s,3,,(C,,C,, + V^:JA'^CA'^ = 0, (C3) 
where £2 ^ (3] and 

C^A^ = A^CA^. (C4) 
The most obvious solution to the minimization equations 



(C3) and (C4| is the isotropic one. 



(C5) 



One can verify that the so lutio n (C5 
maximum) of expression (CI) 



is a minimum (not 
To demonstrate that no 
other minima exist, consider the only alter nati ve to ( [C5| ): 
namely that A^^CA^^ ^ and equation (C3) is instead 



satisfied by making the numerical coefficient vanish. We 
consider the case where this is true for all £2, ^3 (^2 1^ ^3), 
but the ideas generalize straight-forwardly to the case 
with only limited numbers of non-zero off-diagonal terms. 
The most general solution is 



(C6) 
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where the are symmetric matrices which satisfy 
sQ's — (oiT-diagonal) and 



1 



(C7) 



The Q* may be found numerically using a singular value 
decomposition technique. 

Let us consider whether a ph ysic al (positive definite) 
solution to equations (C6| and (C7) exists. A necessary 
condition is that 



< 



V(2^+l)(2f + 1)' 



(C8) 



This condition is violated by equation ( C6 ) with = 



0, but can a suitable choice of remove the violation? 
There are far fewer matrices than degrees of freedom 
m yf", so that one would need a numerical coincidence 
to be able to remove the violation simultaneously at all 

1. We verified computationally that, for our choice of C^, 
this is indeed not possible. 

Now when the Gaussian simplification is abandoned, 
expression (CI ) picks up extra terms of the form 



S£i£2S£3£4Tr CA^iTr CA^^CA^^CA^* and (C9) 
s,,,,s,3,,Tr CA^lCA''-CA^-CA^^ (CIO) 



One may verify that with these terms, equation ( C5 ) re- 
mains a local minimum of the 5i/2 variance. Since there 
are no other valid local minima in the Gaussian approx- 
imation, we should not expect to find new local minima 
appearing in the non-Gaussian case (the simplification 
modifies only the skewness, not the width, of the likeli- 
hood). A full analytic calculation is prohibitive, but the 
numerical results quoted in the main paper use the full, 
non-Gaussian likelihood. 

Now let us consider the cut sky case. The de-biased 
PCL estimates c are related to the power spectrum of 
the cut sky c by 



(Cll) 



for the matrix M defined by equation ( |A18[ ). (The 'PCL' 
superscript is dropped for concision.) Because of the ex- 
act relation 



1/2 ~ 2^ ^i' 



PCL 



(see Appendix [b]) we may define 

s = (M-i)^sM^\ 



so that equation (C12) simplifies to 



(C12) 



(C13) 



(CM) 



The cut-sky reasoning then follows through exactly as for 
the full-sky case, except with s and C replaced by s and 
C respectively. For the lower bound theory one obtains 



w 



(C15) 



Clearly this ignores the implicit restrictions on C aris- 
ing from its status as a cut-sky, rather than full-sky, co- 
variance matrix. (Specifically, a valid C must live in 
the cut-sky subspace so that C = KCK.) However the 
set of all valid C is, crucially, a subset of the positive- 
definite matrices which were considered for the full-sky 
case. Therefore our test theory still gives a lower bound 
for the set of valid theories. 

To actually calculate the lower bound we draw 20 000 
sets of dim's according to the covariance matrix (C15) 



and, for each, calculate the power spectrum Ci and hence 
according to equation (C14). Calculating the vari- 
ance on this random sample leads to the numerical lower 
limit quoted in Section [iVB[ 

Finally note that, because M is almost diagonal, our 
Monte Carlo results are almost equivalent to those ob- 
tained by calculating S1/2 in an isotropic theory satisfy- 



ing 



(CI6) 



This is the justification for our intuitive explanation that 
the lower bound on the variance of iS™2 is given by 
the variance of S1/2 in an isotropic, full-sky theory with 
power spectrum equal to the cut-sky power spectrum of 
the designer theory. 



Appendix D: A numerical problem and solution 

There is a hidden numerical pitfall in the harmonic- 
space masking operation as defined by Eq. ( A2 1 . The 



matrix K is not band-limited, which means that trun- 
cating at finite £ produces cut-sky vectors a which retain 
some information about the data inside the cut. This 
garbled information is visible in maps as low-amplitude 
ringing around the edges of the cut. The QML estimator, 
in particular, is very efficient at regenerating the full sky 
from this trace of unwanted information. 

We investigated two methods of mitigating this prob- 
lem, both of which generated results in good agreement 
with pixel space techniques. The first is heuristic, sim- 
ply smoothing the input (full sky) and output (cut sky) 
maps to angular scales larger than 180°/£ijiax- 

The second, which we adopted for our final results^°, 
is to use an eigenvector decomposition. For a specified 



This method was suggested to us by S. Gratton. 
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sky fraction /, we calculate K to finite £max and find its 
eigenvalues and vectors. We then replace the smallest 
eigenvalues (specifically, a fraction / of the eigenvalues) 
by zero and all other eigenvalues by one. The final oper- 
ation is then guaranteed to be idempotent (unlike the ad 
hoc smoothing approach) and also discards exactly the 
right fraction of information from the input map. Visu- 
ally, we found the maps produced looked almost identi- 
cal to those masked in pixel space. As commented above, 
we verified that the final estimator results produced from 
a harmonic-space analysis were closely compatible with 
those produced from a pixel space analysis. The latter 
are slow and cumbersome [they cannot take advantage 
of simplification (A5|], but do not suffer from the band- 
limitation problem and therefore serve as a useful point 
of comparison. 



Appendix E: Rapid calculation of L: 



2 

max 



In the main text, we discussed the planarity of t = 2 
and ^ = 3 power in the observed CMB. This is uncovered 
[5] by computing the 'angular momentum dispersion' 



-2 _ Em"z2 \atm\'' 



(El) 



designed to detect 'planarity' of power. By maximiz- 
ing this quantity across different rotations of the sky, 
one produces a preferred direction in which the power 
is most planar. Frequentist anomalies are then reported 
if the sky-measured maximum values of have small 
P- values according to Monte Carlo simulations of statis- 
tically isotropic skies - or if the maximizing directions 
for two different Cs are coincident. 

We will show below that calculating Li^ after rotating 
the sky by Euler angles^^ (0 — 7r/2,— 6',0) is equivalent to 
forming the quantity 



(E2) 



where the vector n — (sin Q cos 0, sin Q sin 0, cos 6*) and the 
matrix elements L™™ are given explicitly below. Hence, 
for a given set of a^m's, the problem of maximizing 
reduces to finding the maximal eigenvector of the 3x3 
real symmetric matrix X^mm' ■^ii™''^^m"fon'- This algo- 
rithm is cheaper by orders of magnitude than numerical 
maximization methods that appear to have been used to 
date. 



We adopt the convention of |37l : an Euler rotation (a,/9,7) suc- 
cessively rotates the physical sky relative to the fixed, right- 
handed coordinate system by —a, — /3 and —7 around the z, 
X and z axes respectively. The final z rotation would not affect 
the value of L«, so we fix 7 = 0. 



We now prove relation ( E2 1 and give explicit forms for 
the matrix elements i™™ . Consider the spm-£ function 
which may be expanded as 



(E3) 



By considering the angular momentum of this function 
in the z direction, 



(e^ • J)*^ 



-id^-^i{e, 0) = ^ ma£„r<>,„(6'. 



(E4) 

where J is the fiducial angular momentum operator and 
is the unit vector in the z direction, one may rewrite 
equation (El ) as 



7^2 - 



(e..j)^ 



where the inner product is defined as usual: 
($1*) = / <^*{e,<h)^{e,4>) sin6ld6'c 



(E5) 



(E6) 



To convert expression ( E5 1 for a single Lif value into 



the form ( E2 1 , which is claimed to give all possible sky- 
rotated values, we first need to see that the following two 
operations are equivalent: 

(i) rotating the function by angle a about an axis spec- 
ified by the vector r, then measuring L^; 

(ii) rotating the z coordinate direction by angle —a 
about r while keeping the function fixed, i.e. re- 
placing Gz in equation ( E5 ) by the rotated vector 



The equivalence is intuitively clear because the only pre- 



ferred direction in expression ( E5 1 is given by the vec- 
tor. To establish the result more formally, one can use 
the commutation relations for the angular momentum 
operators applied to an infinitesimal rotation, and then 
extend, as usual, to finite rotations by exponentiation. 
The result is that 











JiJj 

















(E7) 

with n defined as above. In harmonic space the o perat or 
JiJj forms the matrix elements appearing in Eq. (E2): 



r mm 



= {Yi^'\jJj\Yi^) = Jr"'"jf'"'\ (E8) 



where the harmonic space angular momentum operators 
are obtained numerically using the relations 



jmm 

z 


= m 


^mm' ! 


jmm' 


1 

^ 2 


/ jmm' 

v+ 


jmm' 

Jy 


1 

~ 97 


I jmm 

V+ 



r 



r 



and 



(E9) 
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Here the matrix elements of the ladder operators j± obey 

J£""' = Vl{l + ^)-m{m+l)Sm±i,n.'- (ElO) 

Finally we note that analytic moments of the statisti- 
cal distributions may be calculated using the above for- 
malism; however, in practice, the distributions are rather 



asymmetrical at low £ and it is conceptually and compu- 
tationally easier to use Monte Carlo results - this is ex- 
tremely fast using the new algorithm, especially since the 
£^mm jnatrix elements can be precomputed and cached 
for each i of interest. 



